Description of Files
--------------------
SelfEnumerationdo_rep.do contains the code necessary to replicate all results in Nanes & Haim, "Self-Administered Field Surveys on Sensitive Topics" (2020). 

The main dataset necessary to run this .do file is "NanesandHaim JEPS Replication.dta". Appendix Table 1 is produced using the dataset "NanesandHaim JEPS Pilot Rep.dta". 

Description of Variables
------------------------
psgc - a unique identifier code for each "barangay" (local administrative unit) in our survey sample. Codes were merged from the official list of Philippines psgc codes based on enumerator-coded indicators of the respondent's barangay.

age - respondent age in years at the time of the survey

education - 0: elementary or less, 1: some high school, 2: completed high school, 3: some college, 4: completed college, 5: graduate school or more

income - weekly household income in Philippine Pesos (PHP)

sensitivecat - respondent treatment category (direct question, self-enumerated or randomized response). Treatment categories were randomized using the survey software we used on the tablets (ISurvey). 

enumerator - first name of the enumerator who conducted the survey

male - respondent gender; 0=female, 1=male

crowd - dummy variable indicating whether an onlooker was present at any time during the survey. See the online appendix for more details on how enumerators were instructed to code this variable.

hsdirect - Answer to the placebo question "did you attend high school" for the group of respondents who were asked direct questions.

hsself - Answer to the placebo question "did you attend high school" for the group of respondents who were asked self-enumerated questions.

hsrr - Answer to the placebo question "did you attend high school" for the group of respondents who were asked randomized response questions.

reportdir - Answer to the sensitive question for the group of respondents who were asked direct questions.

reportself - Answer to the sensitive question for the group of respondents who were asked self-enumerated questions. 

reportrr - Answer to the sensitive question for the group of respondents who were asked randomized response questions.

reportNPA - Answer to the sensitive question among respondents who answered directly or using self enumeration (0="no", 1="yes")

claimHS - Answer to the placebo question among respondents who answered directly or using self enumeration (0="no", 1="yes")

selfenum - Dummy variable indicating that the respondent answered the sensitive and placebo questions using self enumeration (0="no", 1="yes")

randomresp - Dummy variable indicating that the respondent answered the sensitive and placebo questions using random response / forced choice (0="no", 1="yes")

placer - Dummy variable indicating that the respondent declined to answer the placebo question (1=non-response, 0=provided a response)

selfenum_crowd - Interaction between selfenum x crowd

date - date the survey was conducted (start time)

time - time the survey was conducted (start time)

datend - date the survey was conducted (end time)

Data Processing
---------------
Survey data were originally downloaded directly from the online ISurvey website. The data were then cleaned using an R script and all PII was removed.